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Abstract Imposing an extensional uniformity condition on a non-uni- 
form circuit complexity class C means simply intersecting C with a uni- 
form class C. By contrast, the usual intensional uniformity conditions 
require that a resource- bounded machine be able to exhibit the circuits 
in the circuit family defining C. We say that {C,£) has the Umformity 
Duality Property if the extensionally uniform class C fl £ can be captured 
intensionally by means of adding so-called C-numerical predicates to the 
first-order descriptive complexity apparatus describing the connection 
language of the circuit family defining C. 

This paper exhibits positive instances and negative instances of the Uni- 
formity Duality Property. 
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1 Introduction 

A family {C„}„>i of Boolean circuits is uniform if the way in which Cn+i can 
differ from C„ is restricted. Generally, uniformity is imposed by requiring that 
some form of a resource-bounded constructor on input n be able to fully or 
partially describe C„ (see [8,21,1,5,14] or refer to [24] for an overview). Circuit- 
based language classes can then be compared with classes that are based on a 
finite computing mechanism such as a Turing machine. 

Recall the gist of descriptive complexity. Consider the set of words w e {a, 6}* 
having no h at an even position. This language is described by the F0[<, Even] 
formula -iEli(EvEN(i) A Pb{i)). In such a first-order formula, the variables range 
over positions in w, a predicate for a G {a, b} holds at i iS Wi = a, and a 
numerical predicate, such as the obvious 1-ary Even predicate here, holds at its 
arguments iff these arguments fulfill the specific relation. 

The following viewpoint has emerged [5,3,6] over two decades: when a circuit- 
based language class is characterized using first-order descriptive complexity, the 
circuit uniformity conditions spring up in the logic in the form of restrictions on 
the set of numerical predicates allowed. 
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As a well studied example [12,5], F0[<, +, x] = DLOGTIME-uniform AC° C 
non- uniform AC° = FO[arb], where the latter class is the class of languages 
definable by first-order formulae entitled to arbitrary numerical predicates (we 
use a logic and the set of languages it captures interchangeably when this brings 
no confusion). 

In a related vein but with a different emphasis, Straubing [23] presents a 
beautiful account of the relationship between automata theory, formal logic and 
(non-uniform) circuit complexity. Straubing concludes by expressing the proven 
fact that AC° C ACC° and the celebrated conjectures that AC°[q] C ACC° and 
that ACC" C NC^ as instances of the following conjecture concerning the class 
REG of regular languages: 

Q[arb]nREG= Q[reg]. (1) 

In Straubing's instances, Q is an appropriate set of quantifiers chosen from 
{3} U {3(5''') : < r < g} and reg is the set of rejiular numerical predicates, 
that is, the set of those numerical predicates of arbitrary arity definable in a 
formal sense by finite automata. Wc stress the point of view that intersecting 
{3}[arb] = FO[arb] with REG to form FO[arb]nREG in conjecture (1) amounts 
to imposing uniformity on the non- uniform class FO[arb]. And once again, im- 
posing uniformity has the effect of restricting the numerical predicates: it is a 
proven fact that FO[arb] n REG = FO[reg], and conjecture (1) expresses the 
hope that this phenomenon extends from {3} to other Q, which would determine 
much of the internal structure of NC^. We ask: 

1. Does the duality between uniformity in a circuit-based class and numerical 
predicates in its logical characterization extend beyond NC^? 

2. What would play the role of the regular numerical predicates in such a 
duality? 

3. Could such a duality help understanding classes such as the context-free 
languages in AC°? 

To tackle the first question, we note that intersecting with REG is just one 
out of many possible ways in which one can "impose uniformity" . Indeed, if C is 
any uniform language class, one can replace Q[arb] n REG by Q[arb] fl £ to get 
another uniform subclass of Q[arb]. For example, consider any "formal language 
class" (in the loose terminology used by Langc when discussing language theory 
versus complexity theory [14]), such as the class CFL of context-free languages. 
Undoubtedly, CFL is a uniform class of languages. Therefore, the class Q[arb] fl 
CFL is another uniform class well worth comparing with Q[<, +] or Q[<, x]. 
Of course, FO[arb] fl CFL is none other than the poorly understood class AC° fl 
CFL, and when Q is a quantifier given by some word problem of a nonsolvable 
group, (F0+{(5})[arb] nCFL is the poorly understood class NC^ nCFL alluded 
to 20 years ago [11]. 

The present paper thus considers classes Q[arb] DC for various Q and C. To 
explain its title, we note that the constructor-based approach defines uniform 
classes by specifying their properties: such definitions are intensional definitions. 
By contrast, viewing Q[arb] flREG as a uniform class amounts to an extensional 
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definition, namely one that selects the members of Q[arb] that will collectively 
form the uniform class. In this paper we set up the extensional uniformity frame- 
work and we study classes Q[arb] n £ for Q D {3}. 

Certainly, the uniform class £ will determine the class of numerical predicates 
we have to use when trying to capture Q[arb]n£, as Straubing does for £ = REG, 
as an intensionally uniform class. A contribution of this paper is to provide a 
meaningful definition for the set £'^ of C-numerical predicates. Informally, £^ 
is the set of relations over the natural numbers that are definable in the sense 
of Straubing [23, Section III. 2] by a language over a singleton alphabet drawn 
from £. When £ is REG, the £-numerical predicates are precisely Straubing's 
regular numerical predicates. 

Fix a set Q of monoidal or groupoidal quantifiers in the sense of [5,24,16]. 
(As prototypical examples, the reader unfamiliar with such quantifiers may think 
of the usual existential and universal quantifiers, of Straubing's "there exist r 
modulo q" quantifiers, or of threshold quantifiers such as "there exist a majority" 
or "there exist at least f). We propose the Uniformity Duality Property for 
(Q, £) as a natural generalization of conjecture (1): 

Uniformity Duality Property for (Q, £): 

Q[arb]n£= Q[<,£'^]n£. 

Barrington, Immcrman and Straubing [5] have shown that Q[arb] equals 
AC'^[Q], that is, non-uniform AC'^ with Q gates. Behle and Lange [6] have shown 
that Q[<,£^] equals F0[<, £f*]-uniform AC°[Q], that is, uniform AC°[Q] where 
the direct connection language of the circuit families can be described by means 
of the logic F0[<, £'*^]. Hence the Uniformity Duality Property can be restated 
in circuit complexity-theoretic terms as follows: 

Uniformity Duality Property for (Q, £), 2nd form: 

AC°[Q]n£ = FO[<,£"]-uniform AC°[Q] n £. 

By definition, Q[arb]n£ 3 Q[<, £^]n£. The critical question is whether the 
reverse inclusion holds. Intuitively, the Uniformity Duality Property states that 
the "extensional uniformity induced by intersecting Q[arb] with £" is a strong 
enough restriction imposed on Q[arb] to permit expressing the uniform class 
using the £-numerical predicates, or in other words: the extensional uniformity 
given by intersecting the non-uniform class with £ coincides with the intensional 
uniformity condition given by first-order logic with £-numerical predicates. Fur- 
ther motivation for this definition of Q[<,£^] fl £ is as follows: 

— when constructors serve to define uniform classes, they have access to input 
lengths but not to the inputs themselves; a convenient logical analog to this 
is to use the unary alphabet languages from £ as a basis for defining the 
extra numerical predicates 

— if the closure properties of £ differ from the closure properties of Q[arb], then 
Q[arb] n £ = Q[<, £'^] may fail trivially (this occurs for example when £ = 
CFL and Q = {3} since the non-context-free language {a"b"c" : n > 0} is 
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easily seen to belong to Q[<, C^] by closure under intersection of the latter); 
hence intersecting Q[<,£'^] with £ before comparing it with Q[arb] n £ is 
necessary to obtain a reasonable generalization of Straubing's conjecture for 
classes C that are not Boolean-closed. 

We now state our results, classified, loosely, as foundational observations (F) 
or technical statements (T). We let £ be any class of languages. 

(F) By design, the Uniformity Duality Property for {Q, REG) is precisely Strau- 
bing's conjecture (1), hence its conjectured validity holds the key to the 
internal structure of NC^ . 
(F) The Uniformity Duality Property for ({3}, NEUTRAL) is precisely the Crane 
Beach Conjecture [4] ; here, NEUTRAL is the class of languages L that have 
a neutral letter, i.e., a letter e that may be arbitrarily inserted into or deleted 
from words without changing membership in L. The Crane Beach conjecture, 
stating that any neutral letter language in AC° = FO[arb] can be expressed 
in F0[<], was motivated by attempts to develop a purely automata-theoretic 
proof that Parity, a neutral letter language, is not in AC". The Crane Beach 
Conjecture was ultimately refuted [4], but several of its variants have been 
studied. Thus [4]: 
-the Uniformity DuaHty Property for ({3}, NEUTRAL) fails 
-the Uniformity Duality Property for ({3}, NEUTRAL n REG) holds 
-the Uniformity Duality Property for ({3}, NEUTRAL n {two-letter lan- 
guages}) holds. 

(T) Our definition for the set £^ of £-numerical predicates parallels Straubing's 
definition of regular numerical predicates. For kernel-closed language classes 
C that arc closed under liomoinorphisnis, inverse lionioniorphisms and in- 
tersection with a regular language, we furthermore characterize £^ as the 
set of predicates expressible as one generalized unary £-quantifier applied to 
an FO[<]-formula. (Intuitively, £- numerical predicates are those predicates 
definable in first-order logic with one "oracle call" to a language from £.) 

(T) We characterize the numerical predicates that surround the context-free lan- 
guages: first-order combinations of CFL^ suffice to capture all semilinear 
predicates over N; in particular, FO[<,-h] = FO[DCFL^] = FO[BC(CFL)"], 
where DCFL denotes the deterministic context-free languages and BC(CFL) 
is the Boolean closure of CFL. 

(T) We deduce that, despite the fact that FO[BC(CFL)^] contains all the semi- 
linear relations, the Uniformity Duality Property fails for ({3}, £) in each of 
the following cases: 
-£ = CFL 

— £ = VPL, the "visibly pushdown languages" recently introduced by [2] 

— £ = Boolean closiire of the deterministic context-free languages 

— £ = Boolean closure of the linear context-free languages 

— £ = Boolean closure of the context-free languages. 

The crux of the justifications of these negative results is a proof that the com- 
plement of the "Immerman language" , used in disproving the Crane Beach 
Conjecture, is context-free. 
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(T) At the opposite end of the spectrum, while it is clear that the Uniformity 
Duality Property holds for the set of all languages and any Q, we show that 
the Uniformity Duality Property already holds for (Q, £) whenever Q is a 
set of groupoidal quantifiers and C = NTIME(n)''-; thus it holds for, e. g., the 
rudimentary languages, DSPACE(n), CSL and PSPACE. 

The rest of this paper is organized as follows. Section 2 contains preliminaries. 

Section 3 defines the >C-numcrical predicates and introduces the Uniformity Du- 
ality Property formally. The context-free numerical predicates are investigated 
in Section 4, and the duality property for classes of context-free languages is 
considered in Section 5. Section 6 shows that the duality property holds when 
C is "large enough" . Section 7 concludes with a summary and a discussion. 

2 Preliminaries 

2.1 Complexity Theory 

We assume familiarity with standard notions in formal languages, automata and 
complexity theory. 

When dealing with circuit complexity classes, all references will be made 
to the non-uniform versions unless otherwise stated. Thus AC° refers of the 
Boolean functions computed by constant-depth polynomial-size unbounded-fan- 
in {V, A, -i}-circuits. And DLOGTIME-uniform AC" refers to the set of those 
functions in AC° computable by a circuit family having a direct connection lan- 
guage decidable in time O(logn) on a deterministic Turing machine (cf. [5,24]). 

2.2 First-Order Logic 

Let N be the natural numbers {1,2,3, . . .} and let No = N U {0}. A signature 
(7 is a finite set of relation symbols with fixed arity and constant symbols. A 
cr-structure 21 = (W^,(t^) consists of a set U^, called universe, and a set cr^ 
that contains an interpretation C (U^)^ for each fc-ary relation symbol 
R G a. We fix the interpretations of the "standard" numerical predicates <, +, 
X , etc. to their natural interpretations. By Bit we will denote the binary relation 
{{x,i) G : bit i in the binary representation of a; is 1}. For logics over strings 
with alphabet S, we will use signatures extending az; = {Pa '■ o, G S} and 
identify w = Wi ■ ■ ■ Wn € S* with 21^ = ({1, . . . , n}, a^" }), where » = {i G 
N : u>i = a} for all a e 17. We will not distinguish between a relation symbol 
and its interpretation, when the meaning is clear from the context. 

Let Q be a set of (first-order) quantifiers. We denote by Q[a] the set of first- 
order formulae over a using quantifiers from Q only. The set of all Q[(T]-formulae 
will be referred to as the logic Q[cr]. In case Q = {3} (Q = {3}UQ'}), we will 
also write FO[o-] (FO-|-Q'[cr], respectively). When discussing logics over strings, 
we will omit the relation symbols from as- 

Say that a language L C S* is definable in a logic Q[a] if there exists a 
Q[a-]-formula (p such that 21^ |= (p <^=> w e L for all w G S*, and say that a 
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relation i? C N" is definable by a Q [a] -formula if there exists a formula ip with 
free variables xi, . . . , a;„ that defines R for all sufficiently large initial segment of 
N, i.e., if ({1, ... ,m},(T) 1= (/?(ci, ... ,c„) <s=^ (ci, . . . , c„) € i? for aU m > Cmax, 
where Cmax = max{ci, . . . , c„} [22, Section 3.1]. By abuse of notation, we will 
write L G Q[cr] (or i? e Q[cr]) to express that a language L (a relation i?, resp.) is 
definable by a Q[(7]-formula and use a logic and the set of languages and relations 
it defines interchangeably. 

3 The Uniformity Duality Property 

In order to generalize conjecture (1), we propose Definition 3.2 as a simple gen- 
eralization of the regular numerical predicates defined using V-structures by 
Straubing [23, Section III.2]. 

Definition 3.1. Let V„ = . . . he a nonempty set of variables and let 
S be a finite alphabet. A V„-structurc is a sequence 

w = (ai, l/i) • • • (a™, V^) eiSx *P(V„))* 

such that ai , . . . , a„i £ S and the nonempty sets among Vi, . . . ,Vm form a par- 
tition of Vn (the underscore distinguishes V„-structures from ordinary strings). 
Define Fn ~ {0} x *P(V„). We say that a Vn-structure w is unary if w & F*, 
i. e., if ai ■ ■ ■ Un is defined over the singleton alphabet {0}; in that case, we de- 
fine the kernel of w, kern(w), as the maximal prefix of w that does not end 
with (0,0); to signify that Xi € Va for all 1 < i < n, we also write kern(M;) as 
[xi — ci, . . . ,Xn = Cn] and we let u/^ stand for (ci, . . . , c„). 

We define Struc„ as the language of all such words in F* that are unary 
Vn- structures and let Struc = Un>o Struc„. 

Any set L of unary V„-structures naturally prescribes a relation over the 
natural numbers. Hence, a set of such L prescribes a set of relations, or numerical 
predicates, over N. 

Definition 3.2. Let L C F* be a unary V„-language, that is, a set of unary 
Vn- structures. Let L^ = {w^ : wE L} denote the relation over N" defined by L. 
Then the /2-numerical predicates are defined as 

= {L^'* -.LeC andLQ Struc}. 

We say that a language L is kernel-closed if, for every wGL, kern(M;) G L. We 
further say that a language class C is kernel-closed if, for every L G C there 
exists an L' G C such that = L'^ and L' is kernel- closed. 

Remark 3.3. A unary V„-language L defines a unique numerical relation L^ . 
Conversely, if two unary V„-structures y_ and w_ define the same tuple t;^ = w^, 
then one of the two is obtained from the other by padding on the right with 
the letter (0, 0), i.e., y_ G w(0, 0)* or w; G w(0, 0)*. Hence a numerical relation R 
uniquely determines kern(L) = {kern(M;) : w G L} for any language L such that 
R = L^. 
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We point out the following facts, where we write =g r for the unary predicate 
{x : x = r mod q}. 

Proposition 3.4. Let APER and NEUTRAL denote the set of aperiodic lan- 
guages and the set of languages having a neutral letter respectively. Then 

1. APER'^ = F0[<], 

2. REG'^ = (AC" n REG)N = F0[<, r : < r < g}] = reg, and 

3. neutral"*^ C F0[<]. 

Proof. For part 1, let L e APER with L C Struc„. Define L' = kern(L) • (0, 0)*. 
We claim that L' is aperiodic. To see this, note that for any language K, any 
monoid recognizing L also recognizes LK~^ = {v G F* : vu G L for some u G 
K} [20, Proposition 2.5]. Hence L[(0, 0)*]-i is aperiodic. But kern(L) C L[(0, 0)*]"^ 
So L' equals L[{Q, 0)*]~^ ■ (0, 0)* and is indeed aperiodic. 

Hence there exists a formula G F0[<] such that L{lp) = L' [19]. Let 
ip{x\, . . . ,Xn) be obtained from ip by replacing each P(^Q y^{x), V C Vn, with 
Axisy ^ = ^i^l\xiiv^ 7^ ^i- Then for all w = [a;i = ci, . . . ,a;„ = c„](O,0)-' and 
all m > max{cj : 1 < i < n}, 

2t^|=(^ <^ {{l,...,m},<^) )=^(ci,...,c„). 

Let X and c abbreviate Xi, . . . , a;„ and Ci, . . . , c„, respectively, and let Cmax = 
maxjcj : 1 < i < n}. Then 

c G L'' =^ 3^: [a;i = ci, . . . , = c„](0, 0)^ G L 
=^ [xi = ci, . . . , a;„ = c„] G kern(L) 
^ Vi: [xi = ci,...,a;„ = c„](O,0)* G L' 

=^ Vi: 2l[a,j=ci,...,x„=c„](O,0)« H f 

^Vm>Cinax: ({l,---,m},<,=,c) h V'(^) 

and 

^Vi: [xi =ci,...,x„ = c„](O,0)' ^ X 
^> [xi =Ci,...,Xn = Cn]^ kern(L) 
=^Mi: [xi = ci,...,x„ = c„](O,0)' ^ L' 

=4> Vi: 2l[2,j=ci,...,x„=c„](o,0)* 

({!,..., m},<,=,c) ^ V(^) 

Hence = {c e W : (Vm > c„,ax)[({l, • • • , m}, <, =, c) |= But 
V'(x) G F0[<] and therefore G F0[<]. A 

For the other inclusion, let G N" fl F0[<] via formula i){x). Define ip = 
3xi ■ ■ ■ 3xn {4>{x) A x(^)) ) where 

X{x)= A V (P{0,V){Xi) A^z(^ V {P^o,v'){z) ^ Z = : 
i<i<n ye<p(Vn), v'e^KVn), 
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The purpose of x(£') is to bind the variables in x to their respective values in a 
corresponding V„-structure: for a string w~ (0, Vi) • • • (0, Vm) € F* and a tuple 
c, 2tu^ 1= x{c) holds iff m > max{cj : 1 < i < n}, u; is a unary V„-structure, and 
for all 1 < i < n, Xi £ Vci- Alike the above equivalence, we obtain 

cgR Vm > Cmax: ({1, • • ■,m}, <,=,c) \= ip{x) 

<S=^ Vi: 2t[a,j=ci,...,x„=c„](O,0)* H 

Let Z/ be the unary V„-language {w G Struc„ : 21^ |= y}. Then R = = 

{L{(p))^ with LP e F0[<]. Tims Af^ is finite and aperiodic and ^ APER'*'. 

Part 2 follows analogously from [23, Theorems III. 1.1 and III. 2.1]. 

For Part 3, let R G NEUTRAL"**. Then R = for some neutral letter 
language L C Struc„. Assume that the neutral letter of L is (0,0), otherwise 
L C Struc„ implies that L is empty. Since we can insert or delete (0, 0) in any 
word at will, L is fully determined by the (0, 0)-free words it contains, that is, 

L= y (o,0r(o,yi)(o,0r(o,i/2)---(o,0r(o,Vfe)(o,0r- 

{Vi,V2,---,Vk) partitions V„ 
and {0,Vi){0,V2)--iO,Vk)eL 

This is a finite union of regular aperiodic languages. Hence L is regular aperiodic, 
and is in F0[<] by Part 1. 

Having discussed the ^-numerical predicates, we can state the property ex- 
pressing the dual facets of uniformity, namely, intersecting with an a priori uni- 
form class on the one hand, and adding the corresponding numerical predicates 
to first-order logics on the other. 

Property 3.5 (Uniformity Duality for (Q,£)). Let Q be a set of quantifiers 
and let C he a language class, then 

Q[arb] n/: = Q[<,/:^] nr. 

As Q[arb] = ACO[Q] [5] and Q[<,£"] = FO[<,£^]-uniform ACO[Q] [6], the 
above property equivalently states that 

AC"[Q] n £ = F0[<, £"^]-uniform AC°[Q] n C. 

As a consequence of Proposition 3.4(1-2), the Uniformity Duality Property 

is equivalent to the instances of the Straubing conjectures obtained by setting 
Q and C as we expect, for example Q C {3} U {3(9''') : < r < g} and £ = REG 
yield exactly (1). Similarly, as a consequence of Proposition 3.4(3), the Unifor- 
mity Duality Property is equivalent to the Crane Beach Conjecture if F0[<] C C. 
Property 3.5 is thus false when Q = {3} and C is the set NEUTRAL of all neu- 
tral letter languages. For some other classes, the Crane Beach Conjecture and 
thus Property 3.5 hold: consider for example the case C = REG n NEUTRAL 
[4], or the case Q = {3} and £ C NEUTRAL n F0[+]. Accordingly the Unifor- 
mity Duality Property both generalizes the conjectures of Straubing et al. and 
captures the intuition underlying the Crane Beach Conjecture. Encouraged by 
this unification, we will take a closer look at the Uniformity Duality in the case 
of first-order logic and context-free languages in the next section. 
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In the rest of this section, we present an alternative characterization of 
using FO[<]-transformations and unary Lindstrom quantifiers. This is further 
justification for our definition of £-numerical predicates. The reader unfamihar 
with this topic may skip to the end of Section 3. 



Digression: Numerical Predicates and Generalized Quantifiers 

Generalized or Lindstrom quantifiers provide a very general yet coherent ap- 
proach to extending the descriptive complexity of first-order logics [17]. Since 
we only deal with unary Lindstrom quantifiers over strings, we will restrict our 
definition to this case. 

Definition 3.6. Let A = {ai,...,a4} be an alphabet, ipi, . . . ,(pt-i be F0[<]- 
formulae, each with k + 1 free variables xi,X2, ■ ■ ■ ,Xk,y, and let x abbreviate 
Xi,X2, ■ ■ ■ ,Xk- Further, let Struct(c7) denote the set of finite structures 21 = 
(W^, (7^) over a. Then <^i, . . . , ipt-i define an FO[<]-transformation 

[<fii{x),...,ipt-i{x)]: STRUCT({<,a;i,...,a;fe}) ^ A* 

as follows: Let 21 G STRUCT({<,a;i, . . . ,a;fe}), xf = Ci e U^, I < i < k, 
s = \U^\, then [ip\{x), . . . , yt_i(:E)](2l) =vi---Vs € A* , where 



Vi 



ai, «/2t 1= <^i(ci, . . . ,Cfc,i), 

if^ \= (^j{ci,...,Ck,i) A ACi ""/'((ci, • • • , Cfe, i), 1 <j<t, 
at, i/2l 1= A*=l -'^i(ci, • • • ,Cfe,i). 



A language L C A* and an FO[<]-transformation [ipi{x), . . . , (pt-i{x)] now nat- 
urally define a (unary) Lindstrom quantifier Q^" via 

^\=QTy[Mx,y),---,v^t-i{s,y)] ^ [.^i (f ),..., (21) e l. 

Finally, the set of relations definable by formulae Q^y[(p-i{x,y), . . . ,ipt-i{x,y)], 
where L e C and ipi,..., ipt_i e F0[<], will be denoted by Q^FO[<]. 

The notation [ipi{x), . . . , ipt-i{x)] is chosen to distinguish the variables in 
X from y; the variables in x are interpreted by 2t whereas y is utilized in the 
transformation. 

Theorem 3.7. Let C be a kernel-closed language class which is closed under ho- 
momorphisms, inverse homomorphisms and intersection with regular languages, 
then = Q""FO[<]; that is, the C-numerical predicates correspond to the 
predicates definable using a unary Lindstrom quantifier over C and an F0[<]- 
trans formation. 

Proof. For the inclusion from left to right, consider a relation L^ C N" in and 

let V„ — {xi, . . . ,Xn}- Define the FO[<]-transformation [(pi{x), . . . ,(p2n-i{x)] 
from N" to unary V-structurcs as follows. For 1 < i < 2", let 

Vi= /\iy = Xj) A /\ (y ^ Xj), 
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where Vi denotes the ith subset of V in the natural subset ordering, and as- 
sociate the letter (0, V^) with ipi. Let c = (ci,...c„) G N" and denote by 21 
the structure ({1, . . . ,l},a) with I = maxjjci} and {<, a;i, . . . , C a. Then 
[(fi{x), . . . , (f2'^-i{x)] maps 21 to the unary V„-structure Vi ■ ■ - vi. Then 

cG => 3j3w = [xi = Ci, . . . , Xn = Cn]: w{0, %y G L 

[a;i = ci , . . . , x„ = c„] G L (since L is kernel-closed) 
^ ),..., </.2"-i(.^)] (21) eL 

and the reverse implications hold for any unary V„-language L. 

For the opposite inclusion, let i? C N" n Q™FO[<] via the transformation 
[ipi{x), . . . , ipk-i{x)] and the language L C A* H £, \ A\ = k. Wc must exhibit a 
unary V„-language A G £ such that R = A^, i. e., R = ft,(kern(^)), where 

h : {kern(w) : u; is a unary V„-structure} N" 

[xi = Ci, . . . ,Xn = Cn] ^ (ci , . . . ,C„). 

Our proof is similar to the proof of Nivat's Theorem, see [16, Theorem 2.4]. 

Define B C (r„ x A)* to consist of all words (^j) • • • (fj) such that u = 
Hi' ' 'lii is the kernel of a unary Vn-structurc and [(fi{x), .... (fk-i{x)] maps 
to V = vi - ■ -vi- Define the length-preserving homomorphisms /: (-) ^-^ u and 
g: (f) H^v. Then i?=(/io kern o/)(B n g-'^{L)). 

Wc claim that B is regular. Theorem 3.7 then follows from the closure prop- 
erties of C by setting A = f{B Ci g~^{L)) 

For 1 < i < fc — 1, let (^^(y) be defined as (^^(y) = 3xi ■ ■ ■ 3xn{<fi{x,y)A'i(){x)f\ 
7r(.x)), where ^j{x) and ■n{x) bind the variables in x to their respective values in a 
corresponding unary V„-structure and asserts that each variable occurs exactly 
once; that is, 



V'(^)= A V V ^(<«.-))(2^o, 

l<i<n de/1 Ve^Vn), 
T^{x)= A ^^(( V V P^(0.vy^iz)) ^ Z = Xiy 



l<i<n deA Ve^KV-n), 

xiev 

Now let Xj{z) = Vye'p(v„) ^ ^ j ^ k, where dj is the jth letter in A. 

Then a string (^i) • • • (^') e {Fn x A)* is in B if and only if m = Mi ■ • • is the 
kernel of a unary V^-structure and 

(fe-l i-l k-1 

/\{{^i{z)A/\^<f\{z))^Xi{z))A{;\ ^ Xkiz) 

i=l 1=1 1=1 

holds on z = m^, where the empty conjunction is defined to be true. Concluding, 
B e F0[<] c REG. 

We stress that the above result provides a logical characterization of the £- 
numerical predicates for all kernel-closed classes C forming a cone, viz. a class 
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of languages C closed under homomorphisms, inverse homomorphisms and in- 
tersection with regular languages [10]. As the closure under these operations is 
equivalent to the closure under rational transductions (i.e., transductions per- 
formed by finite automata [7]), we obtain: 

Corollary 3.8. Let C be kernel-closed and closed under rational transductions, 
then £N = Qunpo[<]. 



4 Chciracterizing the Context-Free Numerical Predicates 

In order to examine whether the Uniformity Duality Property for first-ordor 
logics holds in the case of context-free languages, we first need to consider the 
counterpart of the regular numerical predicates, that is, CFL^. Our results in 
this section will relate CFL^ to addition w. r.t. to first-order combinations, and 
are based upon a result by Ginsburg [9]. Ginsburg showed that the number 
of repetitions per fragment in bounded context-free languages corresponds to 
a subset of the semilinear sets. For a start, note that addition is definable in 
DCFL^*. 

Lemma 4.1. Addition is definable in DCFL^. 

Proof. Let V3 = {xi,X2,xz\ and Lj^ be a unary Vn-language defining addition, 
that is, = {{xi,X2-iX^) : Xi-\-X2 = x^}. Then L_|_ is recognized by the following 
deterministic PDA P = (Q, r„, Zi, 5, go, -L, {gacc}), where Q = {go^^acc}, ^ = 
{_L, 0} and S is defined as follows: 

(5(zo,(O,0) ,7)=(2o,07) S{zy ,{0,tl}) ,7)^(2^,7) 

5(zo,(0, {a;}),7) =(2^,07) 6{zy ,{0,{x}), j) = {z^y , j) 

5(^0, (0,M), 7) ={zy ,07) S{z^y,{0,(D) , 0) ={z^y, e) 

5(z„(O,0) ,7)=(0, ,07) 5{z,y,{0,{z}),0) ={z, , e) 

<>{Zx,iO,{y})n) = {Zxy, 7) <5(2z , £ ,-L) =(2acc,-L) 

where 7 £ Z\. 

Next, we restate the result of Ginsburg in order to prepare ground for the 
examination of the context-free numerical predicates. In the following, let w* 
abbreviate {w}* and say that a language L C S* is bounded if there exists an 
n e N and wi, . . . , w)„ e such that L Cw^ ■ ■ ■ ui^. 

Definition 4.2. A set R C Ng is stratified if 

1. each element in R has at most two non-zero coordinates, 

2. there are no integers i,j,k, I and x — (xi, . . . ,Xn),x' = {x'l, . . . ,x'^) in R 
such that l<i<j<k<l<n and Xix'jXkx'i 7^ 0. 

Moreover, a set S C N" is said to be stratified semilinear if it is expressible 

as a finite union of linear sets, each with a stratified set of periods; that is, 

5 = [JZii^io + Ej=i k-dij :ke No}, where G N", a^- € Ng, 1 < j < m, 
1 < i <m, and each Pi = {ctij : 1 < j <ni} is stratified. 
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Theorem 4.3 ([9, Theorem 5.4.2]). Let E be an alphabet and L C ■ ■ - w^ 
be bounded by w\, . . . ,Wn S . Then L is context-free if and only if the set 

E{L) = {(ei, e„) e N^ e L} 

is a stratified semilinear set. 

Theorem 4.3 relates the bounded context-free languages to a strict subset of 
the semilinear sets. The semilinear sets are exactly those sets definable by F0[+]- 
formulac. There are however sets in F0[+] that are undefinable in CFL'^: e. g., if 
R= {{x, 2x, 3x) : X e N} was definable in CFL^ then {a"b"c" : n e N} G CFL. 
Hence, F0[-|-] can not be captured by CFL*^ alone. Yet, addition is definable in 
CFL'*', therefore we will in the following investigate the relationship between first- 
order logic with addition, F0[-|-], and the Boolean closure of CFL, BC(CFL). 

Lemma 4.4. Let ii C N" and let R = L^ for language L. Then L is bounded. 

Proof Let C N" and L such that R = L^. Let L C T*, where V„ = 
{xi, . . . ,Xn} is a set of variables. Every unary V„-structure u; e L of a given 
order type t £ T belongs to the language 

Lt = (0, 0)* (0, Vi) (0, 0)* (0, F2) • • • (0, Vk) (0, 0)* , 

where (Vi, V2, . . . , V/c) is an ordered partition of V„. The number p of such ordered 
partitions depends on n; in particular, p is finite. Hence L C L*L2 ■ ■ ■ L^^, where 
ti,t2, ■ ■ . ,tp exhaust the possible types. Since each letter (0, Vi) belongs to the 

/ \ 2n+l 

language UvcVn (0. ^T, it follows that L C Yli<i<p [UvcVn (0' j 
Lemma 4.5. All bounded context-free languages are definable in F0[-|-]. 

Proof. Since L is bounded, there exist wi, . . . ,Wn G such that L C . . .w*. 
By Theorem 4.3, it holds that the set 

i;(L) = {(ei,...,e„) :<...<" eL} 

is stratified semilinear. It follows by semilinearity alone that, for e = (ei, . . . , e„), 
EiL) = U" i{e : e = a,o + Ejii k ■ dij,k G No}, where aio S N", G for 
all 1 < j < rii, 1 < « < m. As a consequence, L is defined via the formula (f = 

^Wl,. ..,Wn 

(ei,...,e„) A V3iin(ei,...,e„)), where 

^Wl,. ..,Wn 

(ei, . . . ,e„) 

checks whether the input is of the form wl'^ ■ ■ ■ and 

(/5iin(ei, . . . , e„) = Y (Bai^i ■ ■ ■ 3aj,„. f\ (e/c = aiofc + ^1 • 
i=l fc=i j=i 

In particular, <^^^i...^«» G F0[-|-]. Concluding, L G F0[-|-]. 

Lemma 4.6. Let V„ = . . . and let L be a unary Vn-language. Then 
L G F0[+] implies L^ G F0[+]. 
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Proof. Since L e F0[+], there exists a formula G F0[+] such that for all 

({l,...,m},<,+,P) |=(p ^ weL, 

where P ~ {P(o,y) '■ V G ^^Cl^n)} and P(o.y)(z) is true if and only ii z € V, 
ior 1 < z < m. Let y = {yi, . . . , yn)- We construct the formula ^p'{y) from {p by 
replacing, for each variable z and each V e ^(V„), the predicate P(o,y)(-2;) with 
^yiev z = yi^ Ny^^v ^ + Vi- Then 

{{l,...,m}.<,+)\=v'{y) ^ ({l,...,m},<,+,P) h<P- 
Hence the formula witnesses S F0[+]. 

Theorem 4.7. BC(CFL'^) C BC(CFL)'^ C F0[+]. 

Froo/, The inclusion BC(CFL™) C BC(CFL)'^ follows from (1.) the fact that for 
every unary Vn-language L G CFL, there exists an equivalent kernel-closed unary 
V„-language L' e CFL; (2.) n = (Li n and y = (Li U La)^ for 
kernel-closed languages Li,L2; and (3.) the observation that for a kernel-closed 
L e CFL with R = L", the language L' = L{0, 0)* e CFL also verifies R = L'^, 
so that R=(l7n Struc„)N e BCCCFL)"^. 

It remains to show that BC(CFL)^ C F0[+]. Denote by eoCFL set of lan- 
guages whose complement is in CFL, i.e., coCFL = {L : L E CFL}. Let 
R C BC(CFL)'*' n N" and V„ = {xi,...,Xn}. Further, let L be some unary 
V„-language such that R = L^; w. 1. o. g. L = UHi 0^=1 where Lij G 
CFL U coCFL. It holds that L is a unary Vn-language if and only if 0^=1 ^ij is a 
unary V„-language for all 1 < i < m. Hence, we need to show that Lf is F0[+]- 
definable, for any unary V„-language = H^li Lij with Lij G CFL U coCFL. 

Note that Struc„ is bounded and definable in F0[<] C REG. Then Li = 
[Lii n Struc„) n • • • n [Lik n Struc„) and each (L^ n Struc„), 1 < j < A;, is 
bounded (Lemma 4.4). We have to distinguish the following two cases: 

Case 1: G CFL. Then Lij n Struc„ G CFL. Thus Lemma 4.5 implies Lj n 
Struc„ G F0[+]. 

Case 2: Lij G coCFL. As Lij n Struc„ is bounded, it can be written as Lij n 
Struc„ = {w'l^ ■ ■ ■ w'^" : (ei, . . . , e„) G X} for words Wi,. . . ,Wn and some 
relation X C Nq . Hence, 

Lij n Struc„ = Lij U Struc„ = {Lij n Struc„) U Struc„ 

where Lij O Struc„ is the intersection of a context-free language with a reg- 
ular language and therefore context-free. Further note that Lij fl Struc„ = 
{wl'^-'-w^" G Struc„ : (ei,...,e„) ^ X} is bounded. From Lemma 4.5 
it now follows that Lij Ci Struc„ G F0[-|-]. Thus, finally, Lij (1 Struc„ = 

(X^ n Struc„) U Struc„ G F0[-|-]. 

Summarizing, Li = {Ln H Struc„) fl • • • n {Lik Struc„) is definable in F0[+] 
using the conjunction of the defining formulae. Since Li is a unary V„-language 
by assumption. Lemma 4.6 implies the claim. 
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That is, the relations definable in the Boolean closure of the context-free 
unary 14-languages are captured by F0[+]. Hence, FO[BC(CFL)'^] C F0[+]. 
Now Lemma 4.1 yields the following corollary. 

Corollary 4.8. FOpCFL^] = FOfCFL'^] = FO[BC(CFL)'^] = F0[+]. 

We note that in particular, for any k gN, the inclusion {f])^ CFL)^ C F0[+] 

holds, where H^, CFL denotes the languages definable as the intersection of < fc 
context-free languages: this is deduced from embedding numerical predicates 
derived from the infinite hierarchy of context-free languages by Liu and Weiner 
into CFL^ [18]. Hence, 

CFL« C . . . C (n, GFLf C (n,+i CFL)N C • • • C CFL)" C F0[+]. 

Unfortunately, we could neither prove nor refute F0[-|-] C BC(CFL)^. The 
difiiculty in comparing F0[-|-] and BC(CFL)'*' comes to some extent from the 
restriction on the syntactic representation of tuples in CFL; viz., context-free 
languages may only compare distances between variables, whereas the tuples 
defined by unary Vn-languages count positions from the beginning of a word. 
This difference matters only for language classes that are subject to similar 
restrictions as the context-free languages (e.g., the regular languages are not 
capable of counting, the context-sensitive languages have the ability to convert 
between these two representations). To account for this special behavior, we will 
render precisely CFL^ in Theorem 4.9. 

But there is more to be taken into account. Consider, e. g., the relation R = 
{{x,x,x) : X € N}. R is clearly definable in CFL", yet the set E{L) of the 
defining language L, = R. is not stratified scmilincar. Specifically, duplicate 
variables and permutations of the variables do not increase the complexity of a 
unary Vn-language L but affect L^. 

Let t be an order type oi x = {xi. . . . , Xn) and say that a relation R C N" 
has order type t if, for all x G R, x has order type t. For x of order type t, 
let x' = {x[, . . . ,x'^), m < n, denote the variables in x with mutually distinct 
values and let tt/ denote a permutation such that < ^7rt(i+i)' 1 ^ * < 'Ti- 

We define functions sort: «P(N") -> <P(N'") and rfijff :V(N") ^ q3(Ng) as 

sort{R) = {TTt{x') : X € R has order type t}, 

i 

diff{R) = {{xi)i<i<n : (y^Xj) G r}. 

K. \^ — / l<i<n J 

The function sort rearranges the components of R in an ascending order and 
eliminates duphcates, whereas diff transforms a tuple {xi, . . . ,Xn) with xi < 
X2 < ■ ■ ■ < Xn into (xi, ^2 — xi, — X2 — .Ti, . . . , — Y^^=i Xi), a, representation 
more "suitable" to CFL (cf. E{L) in Theorem 4.3). 

Theorem 4.9. Let R C N". R G CFL" if and only if there exists a partition 
R = Ri D ■ ■ ■ U Rk such that each diff (sort (Ri)) , 1 < i < k, is a stratified 
semilinear set. 
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Proof. For the direction from left to right, let € CFU\ C N" and let 
ti, . . . ,tp exhaust the possible order types of x = (xi, . . . , a;„). As CFL is closed 
under intersection with regular languages, L can be partitioned into context-free 
languages L — Lt^ U • • • U Lt^ such that Lf. has order type ti, 1 < i < p- 

Fix any Lt^. By design of sort, it holds that xi < X2 < • ■ ■ < x„i for all 
{xi, . . . ,x„i) e sort{Lf,), hence diff [sort{L^.)) £ N™ is defined. Since x has 

order type ti for all x G Lt-, sort{L^.) — {(p(Lt-)f^ for a homomorphism (p 
substituting the characters {0,V), V ^ %, with appropriate (0, V). We thus 
obtain that sort(Lf.) G CFL^. Say = sort{L^.) for the context-free unary Vn- 
language A, then Theorem 4.3 implies that the set E{A) is a stratified semilinear 
set. Consider the finite state transducer T in Figure 1. T defines the rational 



(O,0)/ai (O,0)/a2 (O,0)/a^ (O,0)/e 
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(0,{n-i})/ai ^ C .J (0,{a'2} ) / a2 (0,{a:^- i})/a^- ^ < ^ {0,{x„})/a„ 



Figure 1. Transducer T 



transduction ip: F* —>■ {ai, . . . , a™, e}*, ip{w) — vi ■ ■ -Vg, where w — Wi - ■ -w, 
\w\ = s, is the given unary V„-structure and, for l<i<s,l<j<m, 




a,j, if • • '^ils = (0, 0)'(O, {xj})u for some I £ No,u G F*, 
e, if u)j • • ■Wi{0, 0)' = w for some I e Nq. 



We claim that diff (^sort{Lf.)) = E(^ip{A)Y The claim concludes the direction 
from left to right, since ip{A) G CFL due to the closure of CFL under rational 
transductions. 

Claim. Let A G CFL such that A^ = sort{Ll), then diff (sortiLf.)) =E{i:{A)). 

To prove the claim, let V„ — {xi, . . . ,Xm} and let A G CFL be a unary 
Vn-language such that A^ = sort{Lf.). Fix an arbitrary c ~ (ci, . . . , c„i) G A^ 
and choose w£ A such that = c. Then ci < C2 < • • • < c,n and 

w = (0, 0)^1-1(0, {a;i})(0, 9y--''-\0, {x^}) ■ ■ ■ 

(O,0)='"-^"^'^--i(O,{x„})(O,0)^ 

where d — \w\ — c^n- Hence il^{w) = a^^a2^~'^^ • ■ • a^r ^'e*^ and 

m — 1 

E{lp{{w})) = |(ci,C2 - Cl, . . . ,Cm - ^ Cj^j. 

1=1 

On the other hand, diff{{c}) = {(ci,C2 — ci,...,Cm — X^ill^^ '^O}- Thus, for 
every c G sori(LN), i;(V'({c})) = dijfl{c}) and diff{sort{Ll)) = 
This implies the claim and concludes the direction from left to right. 
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For the direction from right to left, it suffices to show that Ri € CFL for 
each 1 < i < k. By assumption, diff (^sort{Ri)) C N™ is a stratified scmilin- 
ear set. Thence there exists a bounded language A e CFL such that E{A) = 
diff [sort{Ri)) . Let A w. 1. o. g. be bounded by ai, . . . , a™ G 1^, i. e., A C a^ • • • aj^. 
Define the rational transduction x- {^i, . . . , a„i}* F* as x(^) = Vi'"11sj 
where w = Wi ■ ■ ■ Wg, = •''^ 'A'aA, for 1 < i < s, 1 < j < m, 

{(0, {xj}), if i = s and Wi = aj or i < s and Wi = a,j ^ Wi+i, 
(0, 0) if i < s and Wi = Wi+i. 

Note that {tp o x) {w) = w for all w G A. An argument analogous to the above 
claim thus yields E{A) = ^(V(x(^))) = diff{{x{A)f) and soH(Ri) = ix{A)f- 
In particular, ix{A)f G CFL^. 

Moreover, Ci < C2 < • • • < Cm for all c = (ci, . . . ,Cm) € (x(^))'^, thus there 
exists a function tt : {1, . . . , n} ^ {1, . . . , m}, n>m, such that 

{iXn{l),---,X^{n)) ■ {xi,...,Xm) S (x(A))'^} = Ri- 

Let the homomorphism cp: F* ^ F* mimic the above transformation by replac- 
ing (0, {a;^}) with (0, Vi), where Vi = {xj : 1 < j < n,'jT{j) = i}, 1 < i < m. 
Then Ri = ((</> o x)(^))'' € CFL". 

5 The Uniformity Duality and Context-Free Languages 

Due to the previous section, we may express the Uniformity Duality Property for 

context-free languages using Corollary 4.8 in the following more intuitive way: let 
Q = {3} and £ be such that FO[C^] = F0[<, +] (e. g., DCFL C £ C BC(CFL)), 
then the Uniformity Duality Property for ({3}, C) is equivalent to 

FO[arb]n£ = FO[<,+]n£. (2) 

We will hence examine whether (2) holds, and see that this is not the case. 
For a binary word u = Un-iUn-2 ■ ■ - uq € {0, 1}*, we write u for the integer 

u„_i2"~^ H h 2ui + uq. Recall the Immerman language Lj C {0, 1, a}*, that 

is, the language consisting of all words of the form 

xi ax2 a • ■ • a.X2", 

where Xi e {0, 1}", Xi + 1 — x^+i, 1 < i < 2", and xi — 0", = 1". For exam- 
ple, OOaOlalOall G L/ and OOOaOOlaOlOaOllalOOalOlallOalll G L/. We prove 
that despite its definition involving arithmetic, Lj is simply the complement of 
a context-free language. 

Lemma 5.1. The com.plement Fj of the ImmerrriMn language is context-free. 
Proof. Let S = {0, 1, a}. Throughout this proof, u and v stand for binary words. 
Claim. Let u = ti„_iu„_2 • • • wo, v = Vn-iVn^2 ■ ■ 'Vq and uq ^ vq. Then 

w + 1 = u (mod 2") (3) 
iff none of the words UiUqViVq,U2UiV2Vi, . . . , Un-iUn-2Vn-iVn-2 belongs to 

{0010, 0011, 0100, 0111, 1001, 1000, 1110, 1101}. (4) 
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The claim implies that the following language is context-free: 

A = {xuAvy : X & E*,y e S* , \u\ = \v\,u + l ^ v (mod 2l"l)}. 

Note that {xua.vy : x e (X'*a)*, y e (aS*)*, \u\ = 1 ^ v (mod 2l"l)} c 

A C Li. Hence A does not catch all the words in L/, but it catches all the 
words of the correct "form" which violate the successor condition (modulo 2'"'). 
For example, A catches OOaOlallalla, but it does not catch a nor OalaOal 
nor OlalOallaOO nor OOaOlalOallaOOaOl nor OOaOlOOlalOall. We complete the 
proof by expressing Lj as follows: 

17 = ^ U a* U r^aO^aZ"* U Z'*al*aZ'* U 

{0, lyiS* U r'^0{0, l}* U U|„|^|„|(^*a)*«az;(ai:*)\ ^ ' 

Now we prove the claim. The direction from left to right is easy: if any word 
UiUi^iViVi^i belongs to the forbidden set (4) then (3) trivially fails. For the 
converse, we must prove that if no forbidden word occurs then (3) holds. We 
prove this by induction on the common length n of the words u and v. When 
n ~ I, uq and vq =/= uq are successors modulo 2. So suppose that n > 1 and that 
none of the forbidden words occurs in the pair (u„w, Vnv), where w„, u„ € {0, 1}, 
u = Un-i ■ ■ ■ uiuo and v = Vn-i ■ ■ ■ viVo and uq 7^ vq- Then by induction, u + 1 = 
V (mod 2"). There are four cases to be treated: 

Case 1: = ('n-i = 0. Then no overflow into Un occurs when 1 is added to 

u to obtain v. Since the forbidden words leave only u„ = Vn as possibilities, 
+ 1 = u^. 
Case 2: Un-i = Vn-i = 1- Analogous. 
Case 3: = ^ fn-i = 1- Analogous. 

Case 4: 1 = ^ Vn-i = 0. This is an interesting case. The fact that u+1 = v 
(mod 2") implies that u = 1" and v = 0". Now the forbidden words imply 
Un Vn- This means that either u„w = 01" and VnV = 10", or w„u = 1"+^ 
and VnV = 0"+^. In the former case, + 1 = v^, and in the latter case, 
+ 1 = v^ (mod 2"+^). 

For a language L C let Neutral(L) denote L supplemented with a neutral 
letter e ^ S, i.e., Neutral(L) consists of all words in L with possibly arbitrary 
repeated insertions of the neutral letter. 

Theorem 5.2. FO[arb] n BC(CFL) D F0[<, +] n BC(CFL). 

Proof. From the Crane Beach Conjecture by Barrington et al. [4, Lemma 5.4], 

we know that Neutral(L/) £ FO[arb] \ F0[<;+] . So we are done if we can 
show Neutral(L/) G BC(CFL). We proved in Lemma 5.1 that XT = {0, l,a}* \ 
Li is context-free. Therefore Neutral(Z/7) is context-free. Now, for any We € 
{0, 1, a, c}*, let w G {0, 1, a}* be the word obtained by deleting all occurrences 
of the neutral letter e from w^. Then for any We, 

Wo G Neutral(L/) <^=> w G Lj <^=> w ^ Lj <=^ Wc ^ Neutral(L/). 

In other words, Neutral(L/) = {0, 1, a, e}*\Neutral(iy/) = Neutral(iy/) and thus 
Neutral(L/) G BC(CFL). 
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Theorem 5.2 implies that the Uniformity DuaHty Property fails for Q = {3} 
and £ = BC(CFL), since FO[<,BC(CFL)'*'] = F0[<,+]. Yet, it even provides 
a witness for the failure of the duality property in the case of £ = CFL, as the 
context-free language Neutral(L/) lies in FO[arb] \ F0[<, +]. We will state this 
result as a corollary further below. For now, consider the modified Immerman 
language Rj defined as Lj except that the successive binary words are reversed 
in alternance, i.e., 

Ri = {.. . ,000a(001)^a010a(011)'"al00a(101)'"all0a(lll)^, . . .}. 

Ri is the intersection of two deterministic context-free languages. Even more, the 
argument in Lemma 5.1 can actually be extended to prove that the complement 
of Ri is a linear CFL. Hence, 

Theorem 5.3. 1. FO[arb] n BC(DCFL) D F0[<, +] n BC(DCFL). 
2. FO[arb] n BC(LinCFL) D F0[<, +] n BC(LinCFL). 

Proof. For the first claim, observe that Ncutral(i?/) ^ F0[<, because F0[<, +] 
has the Crane Beach Property and Rj ^ REG. On the other hand, Neutral(i?/) S 
FO[arb]; and since Ri G BC(DCFL), Ncutral(i?/) e BC(DCFL). 

For the second claim, Rj can be expressed analogously to Lj by substituting 
A with an appropriate set A' in (5). Further, for uav G S* with binary words 
u and V, the conditions + 1 ^ v (mod 2l"') and u + 1 ^ (mod 2l"l) can 
be checked by a linear CFL. Since the linear context-free languages are closed 
under finite union, the claim follows. 

The role of neutral letters in the above theorems suggests taking a closer look 
at Neutral(CFL). As the Uniformity Duality Property for ({3}, Neutral(CFL)) 
would have it, all neutral-letter context-free languages in AC'' would be regular 
and aperiodic. This is, however, not the case as witnessed by Neutral(i/). Hence, 

Corollary 5.4. In the case of Q = {3}, the Uniformity Duality Property fails 
in all of the following cases. 

1. C = CFL, 

2. /: = BC(CFL), 

3. C = BC(DCFL), 

4. C = BC(LinCFL), 

5. £ = Neutral(CFL). 

Remark 5.5. The class VPL of visibly pushdown languages [2] has gained promi- 
nence recently bccaiisc it shares with REG many useful properties. But despite 
having access to a stack, the VPL- numerical predicates coincide with REG^, for 
each word may only contain constantly many characters different from (0,0). 
It follows that the Uniformity Duality Property fails for VPL and first-order 
quantifiers: consider, e.g., L = {a"b" : n > 0} G FO[arb] n (VPL \ REG) then 
L G FO[arb] n VPL but L ^ F0[<, VPL'^] n VPL. 
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6 The Duality in Higher Classes 



We have seen that the context-free languages do not exhibit our conjectured 
Uniformity Duality. In this section we will show that the Uniformity Duality 
Property holds if the extensional uniformity condition imposed by intersecting 
with C is quite loose, in other words, if the language class C is powerful. 
Recall the notion of non-uniformity introduced by Karp and Lipton [13]. 

Definition 6.1. For a complexity class C, denote by £/poly the class C with 
polynomial advice. That is, £/poly is the class of all languages L such that, for 
each L, there is a function f: N — »■ {0, 1}* with 

2. Lf = {{x,f{\x\)):x&L}&C, 

where p is a polynomial depending on C. Without loss of generality, we will 
assume \f{x)\ — \x\^ for some fc € N. 

Note that, using the above notation, DLOGTIME-uniform AC°/poly = AC°. 
As we further need to make the advice strings accessible in a logic, we define the 
following predicates. 

Following [5], we say that a Lindstrom quantifier Ql is groupoidal, if X e 
CFL. 

Definition 6.2. Let Q be any set of groupoidal quantifiers. Further, let L € 
DLOGTIME-uniform AC°[Q]/poly and let f be the function for which e 
DLOGTIME-wm/orm AC°[Q]. Let r = 2kl + 1, where k and I are chosen such 
that the circuit family recognizing L in DLOGTIME-Mni/orm AC° has size n} 
and \f{x)\ = \x\''. We define Advice{ q e FO+Q[arb] to be the ternary relation 

Advice{ q = {{i,n,n^) : bit i of f(n) equals 1}, 

and denote the set of all relations Advice;^^ q, for L € C, by Advice^^q. 

The intention of Advice{^ q is to encode the advice string as a numerical 
relation. A point in this definition that will become clear later is the third argu- 
ment of the Advice{ g-predicate; it will pad words in the corresponding unary 
V„-language to the length of the advice string. This padding will be required for 
Theorem 6.4. 

Theorem 6.3. Let C be a language class and Q be a set of groupoidal quantifiers. 
Then the Uniformity Duality Property for ({3} U Q, C) holds if Bit e £^ and 
Advice£,q e C^. 

Proof. Let £ be a language class that satisfies the requirements of the claim. We 
have to show that FO+Q[arb] f^ C = YO+Q[<,C^^] n C. 

The inclusion from right to left is trivial. For the other direction, let L € 
FO-|-Q[arb] fl C Without loss of generality, we assume L C {0,1}*. In [5], 
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Barrington et al. state that FO+Q[arb] = FO+Q [Bit] /poly = DLOGTIME- 
uniform AC°[Q]/poly for arbitrary sets Q of monoidal quantifiers, but what is 
needed in fact is only the existence of a neutral clement, which is given in our 
case. There hence exists a polynomial p(n) = n'^, fc S N, a function / with 
\f{x)\ =p(\x\), and a DLOGTIME-uniform circuit family {C„i}m>o that recog- 
nizes . From {Cm}m>Qi we construct a formula ip E FO+Q[<,£'^], essentially 
replacing the advice input gates with the relation Advice{ q. 

Let Xi, . . . ,Xn,yi, ■ ■ ■ , denote the input gates of circuit Cm, m > 0, where 
X\ ' ' ' Xji — X and j/i • • -j/nfc = f{\x\). First canonically transform {C„i]m>o into 
a formula 93' over an extended vocabulary a that satisfies L{ip') = L (cf. [5, The- 
orem 9.1], [23, Theorem IX. 2.1] or [24, Theorem 4.73] for the construction of (f'). 
Let Z G N be such that n' is a size bound on {C„i}m>o- Then the transformation 
encodes gates as Z-tuples of variables over {l,...,n} and ensures the correct 
structure using additional predicate symbols Fred, InputO, InputI, Output, 
And, Or, Not and predicates QuanTq for the subset of oracle gates from Q 
used in Cm- For example, the relation Fred holds on a tuple {zi, . . . , Z21) iff 
the gate encoded by {zi, . . . , zi) is a predecessor of (^z+i, . . . , 22/): and each of 
remaining predicates holds on a tuple (zi, . . . , z;) iff (zi, . . . , z;) encodes a gate 
of the corresponding type. Note that each relation in a is definable in F0[<, 
as DLOGTIME C FO[Bit] C F0[<,/:'^]. 

Next, replace the relations InputO and InputI corresponding to the input 
gates Hi, 1 < i <n'', with the respective advice predicates from Definition 6.2, 

-iAdvice{ g(«,n,n'') and Advice{ n, n''); 

both of which are definable in C^'^ by assumption. In the resulting formula, replace 
the remaining predicates besides < and Bit by their defining FO-t-Q[<, £'^]- 
formulae and eventually obtain a formula defining L e FO-|-Q[<, jC^]. 

We can now give a lower bound beyond which the Uniformity Duality Prop- 
erty holds. Let NTIME(n)'^ denote the class of languages decidable in linear 
time by nondeterministic Turing machines with oracles from C 

Theorem 6.4. Let Q be any set of groupoidal quantifiers and suppose C = 
NTIME(n)^. Then the Uniformity Duality Property for ({3} U QX) holds. 

Proof. Choose any L e AC°[Q]n£ and let / be an advice function for which e 
DLOGTIME- imiform AC°[Q]. We have to show that the relation Advice{ q is 
definable in £^ . Let k €N such that = and denote by {Cm}m>o the 

DLOGTIME-uniform circuit family of size < that recognizes using oracle 
gates from Q. Let iV be a nondeterministic linear-time Turing machine deciding 
L using some oracle L' € £. 

We will define a nondeterministic Turing machine M that decides x e La^v, 
where i^Adv is such that i-Adv — ADVICE^^ g. Given input x, M proceeds as 
follows: 

1 if a; is not of the form x = [xi = i, X2 = n,Xz = for some 
n > and 1 <i <n 
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2 then reject; 

3 for all strings a of length n'^ in lexicographic ordering do 

4 t ^ true; 

5 for all inputs y of length n do 

6 if the output of C„+„i= on y with advice a contradicts the result of 

N OT\ y 

7 then t ^ false; 

8 if t = true and bit i of a is 1 then accept; 

9 else reject; 

That is, M guesses the advice string a using a nai've trial-and-crror approach; 
once the correct advice string a has been found, it accepts x iff the ith bit in the 
advice string a is on. Thus M decides LAdv 

As for the time required by M, note that in hnc 6 the circuit C„_|_„fc can 
be evaluated in time 0((n + n*^)^') = 0(n'') = 0(|a::|). Furthermore, C = 
NTIME(n)''' implies that £ is closed under complement. Thus the above al- 
gorithm solves the problem in NTIME(|a;|)™NTiME(|a;|)^ ^ NTIME(jx|)^ = C 
and ADVlCE{_g e C^l The claim now follows from Theorem 6.3, because 
C = NTlME(n)^ moreover implies Bit e C^. 

Corollary 6.5. Let Q be any set of groupoidal quantifiers. The Uniformity Dual- 
ity Property holds for ({3}uQ, £) iff, equals the deterministic context-sensitive 
languages DSPACE(n), the context-sensitive languages CSL, the rudimentary 
languages (i. e., the linear time hierarchy [26]), PH, PSPACE, or the recursively 
enumerable languages. 

Proof All of the above classes satisfy C = NTIME(n)'^. 
7 Conclusion 

For a set Q of quantifiers and a class C of languages, we have suggested that 
Q[arb] fl C defines an (cxtcnsionally) uniform complexity class. After defining 
the notion of £- numerical predicates, we have proposed comparing Q[arb] fl £ 
with its subclass Q[<,£'^] n£, a class equivalently defined as the (intensionally) 
uniform circuit class F0[<, £^]-uniform AC°[Q] fl C. 

We have noted that the duality property, defined to hold when both classes 
above are equal, encompasses Straubing's conjecture (1) as well as some positive 
and some negative instances of the Crane Beach Conjecture. 

We have then investigated the duality property in specific cases with Q = 
{3}. We have seen that the property fails for several classes £ involving the 
context-free languages. Exhibiting these failures has required new insights, such 
as characterizations of the context-free numerical predicates and a proof that the 
complement of the Immerman language is context-free, but these failures have 
prevented successfully tackling complexity classes such as AC^flCFL. Restricting 
the class of allowed relations on the left hand side of the uniformity duality 
property from arb to a subclass might lead to further insight and provide positive 
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examples of this modified duality property (and address, e.g., the class of context- 
free languages in different uniform versions of AC*^). Methods from embedded 
finite model theory should find applications here. 

More generally, the duality property widens our perspective on the relation- 
ship between uniform circuits and descriptive complexity beyond the level of NC^. 
We have noted for example that the property holds for any set of groupoidal quan- 
tifiers Q 3 {3} and complexity classes C that are closed under nondeterministic 
linear-time Turing reductions. 

A point often made is that a satisfactory uniformity definition should ap- 
ply comparable resource bounds to a circuit family and to its constructor. For 
instance, although P-uniform NC-*^ has merit [1], the classes AC'^-uniform NC^ 
and NC^-uniform NC^ [5] seem more fundamental, provided that one can make 
sense of the apparent circularity. As a by-product of our work, we might suggest 
F0[<,£^] n £ as the minimal "uniform subclass of C and thus as a meaning- 
ful (albeit restrictive) definition of C-uniforrn C. Our choice of F0[<] as the 
"bottom class of interest" is implicit in this definition and results in the contain- 
ment of C-uniform C in (non-uniform) AC° for any C. Progressively less uniform 
subclasses of C would be the classes Q[<, C^] fl £ for Q D {3}. 

Restating hard questions such as conjecture (1) in terms of a unifying prop- 
erty does not make these questions go away. But the duality property raises 
further questions. As an example, can the duality property for various (Q, C) be 
shown to hold or to fail when Q includes the majority quantifier? This could 
help develop incisive results concerning the class TC*' . To be more precise, let us 
consider Q = {3,MAJ}. The majority quantifier is a particular groupoidal (or, 
context-free) quantifier [16], hence it seems natural to consider the Uniformity 
Duality Property for ({3,MAJ}, CFL): 

FO-hMAJ[arb] n CFL = FO-hMAJ[<, -|-] n CFL. (6) 

It is not hard to see that the Immerman language in fact is in FO-t-MAJ[<, -f], 
hence our Theorem 5.2 that refutes (2), the Uniformity Duality Property for 
(FO, BC(CFL)), does not speak to whether (6) holds. (Another prominent ex- 
ample that refutes (2) is the "Wotschke language" W = {(a"b)" : n > 0}, again 
a co-context-free language [25]. Similar to the case of the Immerman language 
we observe that W e FO-(-MAJ[<, -h], hence W does not refute (6) either.) 

Observe that FO-t-MAJ[arb] = TC" [5] and that, on the other hand, 
FO+MAJ[<,+] = MAJ[<] = FO[+]-uniform linear fan-in TC° [15,6]. Let us 
call this latter class sTC° (for small TC° or strict TC°). It is known that 
sTC° C TC° [16]. Hence we conclude that if (6) holds, then in fact TC"nCFL = 
sTC° n CFL. Thus, if we can show that some language in the Boolean closure 
of the context-free languages is not in sTC", we have a new TC° lower bound. 
Thus, to separate TC" from a superclass it suffices to separate sTC" from a 
superclass, a possibly less demanding goal. This may be another reason to look 
for appropriate uniform classes C such that 

FO-FMAJ[arb] n £ = FO-FMAJ[<, +] n £. 
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